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Alzheimer's disease causes severe neurodegeneration in the brain that leads to a certain death. 
The defining factor is the formation of extracellular senile amyloid plaques in the brain. However, 
therapeutic approaches to remove them have not been effective in humans, and so our understanding 
of the cause of Alzheimer's disease remains incomplete. Here we investigate physical processes that 
might relate to its onset. Instead of the extracellular amyloid, we scrutinize the intracellular domain 
of its precursor protein. We argue for a phenomenon that has never before been discussed in the 
context of polymer physics: Like ice and water together, the intracellular domain of the amyloid 
precursor protein forms a state of phase coexistence with another protein. This leads to an inherent 
instability that could well be among the missing pieces in the puzzle of Alzheimer's disease. 



The neurological origin of Alzheimer's disease involves both genetic and environmental factors PQ-[Sj. Its hallmark is 
the accumulation of the amyloid isoform A/342 into senile plaques [6]- [8]. This has motivated several immunothcrapic 
approaches to either clear or prevent the cerebral A/3 deposits [S]-[TT]. Unfortunately there are serious side-effects 
such as the development of aseptic meningoencephalitis |12j-[14j. As a consequence we do not know whether targeting 
of A/342 will cure or even curb the disease in humans. The excess production of A/342 might just be an indication 
that something else has gone wrong [T3] . [16) . 

The A/342 is a derivative of the transmembrane amyloid precursor protein (APP) by proteolytic cleavages [6], |17) . 
[18j . APP comes in several isoforms, it is naturally present in many organs. Its physiological function remains under 
a debate and the understanding of its proteolytic processing is also incomplete [T5], [T7], [IB]- Both the dominant, 
non-amyloidogenic pathway and the disease related, A/3 generating amyloidogenic pathway produce isoforms of the 
APP intracellular domain (AICD) [17], [T5]. We have scrutinized the physical properties of various AICD complexes, 
searching for an intracellular agent that might correlate with the onset of the anomalous A/342 production. We identify 
an inherently unstable physical phenomenon that has never before been discussed in the context of polymer research. 

After the j/e cleavage of APP the AICD may form a transcriptionally active state with the Fe65 family of nuclear 
multidomain adaptor proteins [17], [2D] -[22]. Even though the relation between AICD and Alzheimer's disease is not 
yet understood, we know that AICD is a product and Fe65 is a participant in the proteolytic cleavage processing of 
APP into A/342. Not surprisingly Fe65 already appears among the potential therapeutic targets [2D], [21], [TTj . 

In isolation AICD is presumed to be an intrinsically unstructured protein [23] . However, upon binding to Fe65, 
AICD can assume a regular form that can be analyzed with x-ray crystallography. Unfortunately, the high precision 
data remains limited. Here we shall investigate the structure with PDB code 3DXC (chain B) [23]. It describes a 
complex of a 28 residue segment of AICD with the larger, 65 residue host Fe65. There are also the closely related 
3DXD and 3DXE, these can be analyzed similarly and with identical conclusions. We find that the complex appears to 
have physical properties that seems to set it apart from all but a very few oligomers. It is an example of an apparently 
previously unrecorded but seemingly systematic phenomenon of protein (polymer) phase coexistence: Like ice with 
water the two proteins are in two different phases. As such, an oligomer that displays the rare and inherently unstable 
phenomenon of phase coexistence is for sure an interesting object for future research. But the delicate balance of the 
AICD/Fe65 complex has the supplementary potential of being an important piece in the puzzle to find a cure for 
Alzheimer's disease. 

The phases of a protein and more generally those of a polymer, are characterized by their fractal (Hausdorff) 
dimension. This is an order parameter that can be computed by inspecting the scaling properties of the radius of 
gyration R g . Asymptotically, in the limit where the number N of monomers becomes very large )25) 



Here Yi are the coordinates of the backbone C a , the pre- factor R Q is an effective inter-monomer distance that is 
independent of N, and v is the compactness index that equals the inverse fractal dimension of the backbone. The 
remarkable property of ([!]) is that v is a universal quantity [25], |26) . Different values of v correspond to different 
phases, and once we know Rq we can unanimously compute the radius of gyration by simply counting the number of 
monomers. All the effects of temperature and chemical microstructure and all atomary level details of a polymer are 
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FIG. 1: a) The (N,R g ) distribution of all single chain PDB proteins with resolution less than 2.0 A and with less than 30% 
homology equivalence. The lower line is the Regge trajectory for mostly a- helical proteins and the top line is a v — 3/5 Flory 
line; there are practically no single chain entries above this line, b) All proteins currently in PDB that are above the Flory 
line. The two Regge trajectories (3) are clearly visible. 



contained in the value of the a priori non- universal and in principle computable pre- factor Rq. 

The relation ([I]) becomes truly precious only in those exceptional circumstances where Rq assumes no more than 
a small number of different values. We argue that this is indeed the case in proteins: When N increases, proteins 
become increasingly uniform in their chemical composition. Consequently it makes sense to employ to study their 
phase structure. Different, clearly identifiable trajectories that are labelled by the different well defined values of 
Rq are then the protein analogs of the Regge trajectories in high energy physics [27] , 

Proteins and other polymers |25) |26) have four major phases: Under physiological conditions and in other bad 
solvents a protein collapses into a space filling conformation with v s» 1/3. For a fully flexible chain we have the 
0-point value v « 1/2 while in the self-avoiding random walk phase we have the Flory value v « 3/5. Finally, when 
v « 1 the protein looses its inherently fractal structure and becomes like a one dimensional rigid rod. Examples of 
this phase are monotonous a-helices and /3-strands that have no additional twists, turns or loops. 

In Figure la we plot all individual single chain proteins in PDB that have resolution less than 2.0 A and homology 
equivalence which is less than 30%. With a few exceptions they assemble around a Regge trajectory with v ss 1/3. 
We also plot the dominant mostly-a-helical trajectory with Rq w 2.29 and v ~ 0.37; The numerical values are slightly 
different for the different protein subclasses like mostly-a-helical, mostly-/3-sheets etc. and this fine structure has been 
discussed in [28], |29j . 

When we extend our analysis to individual chains within oligomers we find two previously unobserved clearly visible 
Regge trajectories (Figure lb). These two trajectories are 

Rf « 0.48 • N - 973 & a 1.02 -TV - 94 (2) 

These trajectories both have v very close to one. Thus they must be in the same universality class, and the difference 
is a finite size effect. This is the universality class of one dimensional rods and sticks. Unlike the other three polymer 
phases, it has no fractal structure. 

(2) 

The trajectory R g includes several membrane proteins and viral capsomers, an example of the latter is 1AIK 
in PDB. The trajectory Rg is mainly populated by collagen proteins such as for example 2CUO in PDB. In both 
trajectories the oligomers commonly consist of several individual chains that are each located on the same Regge 
trajectory. Their mutual interactions provides a supportive lattice structure that protects the individual chains 
against a collapse into the v ps 1/3 phase. 

Remarkably, there are also protein complexes in the Regge trajectories of Figure lb that do not follow the structural 
pattern of collagens, membrane proteins or viral capsomers. In particular, we have found that there is a small number 
of oligomers that are composed of proteins on different Regge trajectories. These complexes consist of (host) sub-chains 
that are in a 0-point trajectory v ss 1/2 and (guest) sub-chains on a similarly uncollapsed i/«l trajectory of Figure 
lb. These oligomers are examples of a previously unrecorded physical phenomenon of protein phase coexistence: The 
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FIG. 2: The distribution of individual chains on the (N, R g ) plane in our second class of phase coexistent complexes. The data 

(2) 

clearly accumulates around the top line that describes the Regge trajectory Rg and the bottom line that describes a 6-point 
Regge trajectory with best fit values Ro = 1.234 and v — 0.508. 



interaction between the host and the guest provides a support that maintains each of them in an inherently unstable 
uncollapsed conformation. 

We have found two different classes of such phase coexistent oligomers in PDB. The first class consists of an 
apparently single protein but with multiple sub-chains that are in different phases. The present PDB codes for these 
proteins are 1WDC, 1G72, 1GOT, 1HTR, 1LTS, 2FP7, 2RIV, 3ABK, 3 ARC, 3CX5, 3DBO. Here we concentrate 
on the second class, formed by complexes with two or more a priori different proteins. The present PDB codes are 
1L2W, 1JDH, 1TH1, 2F8X, 2EPV, 2PRR, 2BFX, 2D7C, 2VGO, 2K8F, 2QKH, 3EGG, 3HTU, 3HPW, 31XS and 
3DXC (3DXD, 3DXE). fn Figure 2 we display the distribution of the individual sub-chains of the second class in the 
(N, R g ) plane, they clearly gather around a v s» 1/2 Regge trajectory and the trajectory in q2p - Biologically, the 
two most notable are 2K8F and 3DXC (3DXD, 3DXE). The former is a bound state of the "molecular interpreter" 
p300 [23] m the Regge trajectory R g , with the tumor suppressing protein p53 in the v 1/2 trajectory. The 
second is the one of interest here, the Alzheimer related AICD/Fe65 complex with AICD in the trajectory R\ ' and 
Fe65 in the v s=s 1/2 trajectory. We now proceed to analyze the peculiar physical properties of the Alzheimer related 
AICD of the second complex. 

In Figure 3a we display the C Q backbone Frenet frame bond and torsion angles of the AICD protein in 3DXC. This 
Figure reveals that the AICD consists of two very closely located loops that are separated from each other by a very 
short /3-strand. We can describe the profile of each of these angles using the soliton solution of nonlinear Schrodingcr 
equation, for the backbone bond angles we have [3T], [32] 

mi . e ci(i-s) _ m2 . e -c 2 (i-s) 
ipi — r — s r — > G M m °d (2%) (3) 

r gCl(j— s) _|_ g— c 2 (i— s) y ' v ' 

while the backbone torsion angles are computed in terms of the bond angles from 

9i = -\ 1 + ° >2 € [-7T, tt] mod (2tt) (4) 
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FIG. 3: a) The spectrum of backbone Frenet frame bond angles ipi (top line) and torsion angles 6i (bottom line) for the AICD 
component of 3DXC (chain B). b) The same spectrum after we have translated the first loop so that it becomes locked by the 
proline at site 669, as described in the text. We use PDB indexing for the sites. 



and the parameter values are listed in Table I: The corresponding (ipi,0i) profiles ([3]), Q describe the first loop at 
sites 676-683 (we use PDB indexing) with RMSD accuracy of 0.29 A and the second loop at sites 681-688 with RMSD 
accuracy of 0.17 A. Both accuracies are substantially better than the experimental B-factor accuracies. In Table I 

TABLE I: Parameter values for the two loops in Figure 2. 



loop 


mi m.2 ci C2 s match 


676-683 


51.517 51.766 2.984 2.983 679.909 177 


681-688 


39.274 38.617 3.327 3.347 682.174 896 



we also list the number of times each of these loops appear in PDB with RMSD accuracy 0.5 A or better. Both are 
abundant in the vsj1/3 Regge trajectory of collapsed proteins. In fact, at the outset there is nothing in the secondary 
structures of this AICD fold that appears unusual for a protein in the collapsed v ss 1/3 phase. Nevertheless it is very 

(2) 

accurately, almost exactly, located on the v w 1 Regge trajectory R g . 

Since the compactness index v is universal and can only have definite discrete values, any continuous and local 
deformation of the protein shape can never cause any kind of discontinuous transition such as a jump between the 
two phases v « 1 and v i=s 1/3. This makes the present combination of the two loops in AICD highly unusual. Even if 
we continuously translate the two loops apart from each other along the backbone by shifting the value s in ^ that 
determines the position of the center of the loop, we can never reach a collapsed Regge trajectory but will always 
remain in the v « 1 phase. 

A scrutiny of the amino acid structure reveals that AICD has a proline at site 669. Since proline often acts as an 
anchor of a loop in a protein in isolation, we propose that the presence of Fe65 prevents the first loop from sliding 
towards its natural position, where it becomes attached with Pro(669). Note that there is another proline at the site 
685 that appears to stabilize the position of the second loop. Using the explicit profile Q we investigate what 
might happen if the first soliton starts sliding towards Pro(669) along the backbone. For this we shift the value of the 
parameter s accordingly. In Figure 4 we show how the radius of gyration R g of the AICD depends on the position of 
the first loop as we slide it towards Pro(669) while keeping the second loop anchored by Pro(685); the final (ipi,0i) 
profile is displayed in Figure 3b. We find that R g increases monotonically when the two loops drift apart. When the 
first loop reaches the position where it becomes locked by Pro (669), the ensuing AICD configuration has relocated 

(2) (3) 

itself from the Regge trajectory R g ' to the Regge trajectory R g ' . Since it should be highly natural for a protein to 
always try and locate itself on a Regge trajectory, we propose that these are the two likely configurations of AICD in 
the complex with Fe65. The presence of two natural alternatives is an indicative of a genetic switching mechanism. It 
would be highly interesting to find out how the biological function of the AICD /Fe65 complex differs between these 
two unfolded conformations of AICD, when the complex becomes translocated to the nucleus and participates in gene 
transcription. Is there a correlation with the onset of Alzheimer's disease? Moreover, the genetic switch could even 
operate solely around Pro(669), the first soliton could conceivably be on either side of this proline. Suppose it is 
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FIG. 4: The evolution of the radius of gyration for the AICD in 3DXC (chain B), during the translation of its first loop to the 
proline at site 669; see also Figure 3b. 



located on the other side of Pro(669) when the 7/e cleavage takes place. It could then become part of A/3. Could this 
cause the formation of senile amyloid plaques? 

In isolation, the v « 1 phase of AICD must be extremely unstable under in vivo conditions, we have not found 
any single strand protein above our v = 3/5 Flory line. Since its two solitons are very common among v w 1/3 
proteins, we predict that an isolated AICD becomes subject to a phase transition that takes it into the collapsed 
v « 1/3 trajectory. Thus we propose that when the two proteins are disengaged, AICD collapses either in a process 
where the two loops first pair-annihilate each other and a new loop structure is formed to bring about the phase 
transition, or alternatively there could be the formation of a new loop near Pro(669). Alternatively, AICD enters a 
highly unstructured and dynamic state where the first loop bounces back and forth between the two prolines, causing 
AICD to oscillate between the two v ss 1 Regge trajectories. Other alternatives also exist. For example the first loop 
could become locked by Pro(669), and the relatively long /3-strand could then buckle to form a new loop. We propose 
xperiments are designed to find out the properties of AICD under various bad solvent conditions. 

We conclude that some of the proteins that are involved in the onset of Alzheimer's disease can be set apart by 
their rare physical properties. In particular the presence of a protein oligomer, and more generally a polymer complex, 
with a phase co-existence should be a challenge for future investigations. In particular, if it turns out that the origin 
of Alzheimer's disease is due to the ensuing instabilities. 
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